Quantify spatial relations to discover handwritten graphical symbols
نویسندگان
چکیده
To model a handwritten graphical language, spatial relations describe how the strokes are positioned in the 2-dimensional space. Most of existing handwriting recognition systems make use of some predefined spatial relations. However, considering a complex graphical language, it is hard to express manually all the spatial relations. Another possibility would be to use a clustering technique to discover the spatial relations. In this paper, we discuss how to create a relational graph between strokes (nodes) labeled with graphemes in a graphical language. Then we vectorize spatial relations (edges) for clustering and quantization. As the targeted application, we extract the repetitive sub-graphs (graphical symbols) composed of graphemes and learned spatial relations. On two handwriting databases, a simple mathematical expression database and a complex flowchart database, the unsupervised spatial relations outperform the predefined spatial relations. In addition, we visualize the frequent patterns on two text-lines containing Chinese characters.
منابع مشابه
Structural analysis of online handwritten mathematical symbols based on support vector machines
Mathematical expression recognition is still a very challenging task for the research community mainly because of the two-dimensional (2d) structure of mathematical expressions (MEs). In this paper, we present a novel approach for the structural analysis between two on-line handwritten mathematical symbols of a ME, based on spatial features of the symbols. We introduce six features to represent...
متن کاملFirst experiments on a new online handwritten flowchart database
We propose in this paper a new online handwritten flowchart database and perform some first experiments to have a baseline benchmark on this dataset. The collected database consists of 78 flowcharts labeled at the stroke and symbol levels. In addition, an isolated database of graphical and text symbols was extracted from these collected flowcharts. Then, we tackle the problem of online handwrit...
متن کاملCombining Structural and Statistical Approach to Online Recognition of Handwritten Mathematical Formulas
This paper introduces a novel method for online recognition of handwritten mathematical formulas. The method is based on the combination of a structural analysis with a statistical model specifying relations of individual symbols. A description of all recognition phases is given, focusing mostly on the structural analysis stage. The recognition process following a bottom-up manner is driven by ...
متن کامل